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ABSTRACT 

Influenza A (H1N1) incidence in India, which is a highly contagious acute 
respiratory disease in humans caused by type A influenza virus. This study 
employs retrospective comparative study of the data from National Centre for 
Disease Control (CDC) yearly reports from April-2010 to April-2019. The case 
fatality rates of Influenza A (H1N1) incidence in India was forecasted using 
autoregressive integrated moving average (ARIMA) models in order to builda 
predictive tool for Influenza A (H1N1) surveillance. Clearly from the study, 
lack of rainfall spread the virus more efficiently and Maharashtra stood first in 
total number of cases and deaths of Influenza A(H1N1) whereas Lakshadweep 
had no signs of disease. Further, number of Cases were reported in the year 
2015 (i.e. 28%) and 25% of cases have been reported in 2017 when compared 
to ten years data. (ARIMA) (2, 1, 3) model was selected for its minimum value 
of normalized BIC, MAPE and good R-Square among all other models. The 
model forecasted the decrease in case fatality rate of Influenza A H1N1 for 
next 10years. Thus, results indicate that, ARIMA models provide a means to 
better understand Influenza A (H1N1) incidence yielding forecasts that can be 
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used for public health planning at the national level. 
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I. INTRODUCTION 

Influenza A (H1N1 or swine flu) is a highly contagious acute 
respiratory disease caused by type A influenza virus that 
regularly causes outbreaks of influenza in humans. As of June 
2009, the CDC has identified that influenza viruses from 
different species infect pigs, thus the viruses can reassert (i.e. 
Swap genes) and new viruses that are a mix of swine, human 
and/or avian influenza viruses can emerge. There are four 
main influenza types A virus subtypes that have been 
isolated in pigs: H1N1, H1N2, H3N2, and H3N1. Most of the 
recently isolated influenza viruses from pigs, however, have 
been H1N1viruses. Influenza viruses in human populations 
are usually classified as seasonal or pandemic. Seasonal 
influenza usually causes yearly epidemics in the old, the very 
young and those with underlying medical conditions, and is 
often associated with cold periods in temperate regions. An 
influenza virus becomes classified as pandemic when a novel 
strain emerges from an animal reservoir and is not only able 
to infect humans but also spreads quickly through the 
population across the globe irrespective of the season. The 
H1N1 influenza virus causes moderate to severe respiratory 
infections. Symptoms include fever, sore throat, cough, 
headache, chills and fatigue. Severe cases include Pneumonia 
bronchitis, sinus infections and an Increase in underlying 
conditions. It is contagious and can be contracted through 
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coughing, sneezing or even talking to someone carrying it. It 
can also be contracted through mouth or nose contact with 
something contaminated|[1]. Of 18 500 laboratory-confirmed 
pandemic-associated deaths identified during April, 2009, to 
April, 2010, worldwide, less than 12% were reported from 
Africa and southeast Asia, although these regions are home 
to more than 38% of the world's population During the 2009 
pandemic, India reported 27,236 laboratory confirmed cases 
of pandemic influenza A (H1N1) with 981 deaths. In 2015, 
India reportedly had over 31,000 people infected and 1,900 
resulting deaths. Most of these cases were reported from the 
States of Rajasthan, Gujarat, Delhi, Jammu and Kashmir, 
Maharashtra, Madhya Pradesh, Telangana, Karnataka and 
Tamil Nadu with a major peak in rainy season (July-August), 
and a minor peak in winter-spring season (December- 
February)|2]. 


Forecasting the future is one the fundamental tasks of time 
series analysis. Although linear methods are useful, a 
prediction from a nonlinear point of view is one step closer 
to reality. The present study is concerned with the building 
of ARIMA models for forecasting case fatality rates of 
Influenza A (H1N1) patterns in India. 
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Il. Materials and methods 

A. Data sources: 

We obtained the data of Influenza A cases from the National 
Centre For Disease Control (Formerly National Institute Of 
Communicable Diseases) which is freely available from the 
website: https://ncdc.gov.in. The data consists of seasonal 
influenza A (H1N1) - state/UT wise number of cases and 
deaths from 2010-2019. It is a retrospective study. 


B. Statistical Analysis: 

Correlation analysis was used to check the relationship or 
association between the cases the deaths of Influenza A 
(H1N1). Year-wise case fatality rate is calculated as the ratio 
of deaths occurring from influenza to the total number of 
cases due to the same cause. Case fatality rate measures the 
severity of the disease that causes death. Thematic maps 
were used to understand the patterns of the regional 
distributions of influenza A (H1N1) cases and deaths in India 
[3,4]. 


ARIMA models offer the statistical infrastructure for 
intervention analysis for the assessment of the clinical effect 
of therapeutic interventions .A ARIMA model introduced by 
Box and Jenkins (1976) was used to describe the different 
model structure with the obvious notations (p, d, q). In this 
process, the prime task is to identify an appropriate subclass 
of models from the general ARIMA family, and construct the 
ARIMA model for the data. The model is estimated using the 
following steps: Identification, estimation, diagnostic 
checking and forecasting. The first step in developing a Box- 
Jenkins model is to determine if the time series is stationary 
or non-stationary. The autocorrelation function (ACF) and 
partial correlation function (PACF) are used to detect the 
stationary or non-stationary of time series and a test of 
significant autocorrelation coefficients using Ljung-Box Chi- 
Square test (Q - test) Augmented dickey fuller (ADF) unit 
root test might be estimated to study the stationary of the 
series [5]. 


After achieving time-series stationary the identification 
model process begins, and we mean by the using of data 
about how they are generated by time-series. The 
identification of the model is the determination of the order 
of the respective processes of the ARIMA model which can be 
achieved using ACF and PACF plots. 


The Method of least square is considered as an easiest way 
which is more widely used to estimate model parameters. 
The aim of Least Squares method is to choose parameter 
estimators that make the sum of residual squares between 
real values and the estimated values of the dependent 
variable possible minimum. Schwarz (1978) proposed 
Bayesian Information Criterion (BIC) for the selection of the 
model. Selection criterions such as R-Square, Stationary R- 
Square, Root Mean Square Error (RMSE), Mean Absolute 
Percentage Error (MAPE), and Bayesian Information 
Criterion (BIC) are used to select the best fitting ARIMA 
model [6,7,8] 


After estimating the model parameters we will carry out the 
test of appropriate quality at this stage, this estimate and the 
test of diagnosed model will be carried out to find out the 
availability of its own conditions, by the test of random errors 
(residuals) [9]. Residuals are the difference between the 
value that occurs and the value that is forecasted for a given 
time period. The estimated model and the parameters are 
checked using certain techniques. The tools used for 


diagnostic checking are: over fitting, residuals of ACF and 
PACF. 


Forecasting is the last stage of study and analysis of time- 
series models, and is the main objective of the study. The 
optimum forecasting is the estimate from the resulting error 
is very small and its variance would be a least [10]. 


C. Statistical software's: 

The above statistical analysis was carried out using the well- 
known software’s: MS-Excel, IBM SPSS Statistics 22 and Arc 
Gis version 10.2 (Demo Version). MS-Excel was used for the 
calculation of case fatality rates and construction of bar and 
pie charts showing distribution of influenza cases and 
deaths. IBM SPSS Statistics 22 proved helpful in correlation 
analysis and building of ARIMA model. Outputs of thematic 
maps showing regional distribution of influenza cases and 
deaths have been driven from Arc Gis version 10.2 (Demo 
Version). A p-value>0.0 5 was considered statistically 
significant. 


III. Results and Interpretation 


STATE WISE DISTRIBUTION OF INCIDENCES 
ASD DEATHS DUE TO HIN] - (2010 - 2019) 





A thematic map Showing the distribution of total 
number of cases and deaths with respect to state /UT 
wise from 2010-2019 of India 


Here we can notice that Maharashtra stands first in total 
number of cases and deaths of Influenza A(H1N1), followed 
by Gujarat, Rajasthan, Madhya Pradesh and Karnataka. 
Whereas Nagaland, Tripura, Sikkim has few number of cases 
reported, but there is no traces of deaths due to Influenza A 
(H1N1) and Lakshadweep had no signs of disease. The states 
that are most affected by the virus experienced huge rainfall 
deficiencies. Long-term climate change has an impact on the 
spread of H1N1 virus in a unique way. 
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Pie chart showing the distribution of deaths of 
Influenza-A (H1N1) 
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We can see that more number of deaths were reported in the 
year 2015 (i.e. 27%) and 21% of deaths have been reported 
in 2017 when compared to ten years data. Further, there 
exists a significant (p value<0.05) strong positive correlation 
between cases and deaths (r=0.871). As number of cases 
increases the number of deaths also increases. 


Table I: Calculated CFR for Influenza A (H1N1) year 
BC 





YEAR | CVR 
2010 | B55 
7011 12.43 
2012 02 
2013 13.3 
2014 73,36 
2015 7.02 
2016 [4,63 
2017 5.64 
2015 | 7.29 
2010 3.71 


From the table it is clear that proportion of deaths for a 
specified population containing H1N1 influenza cases is 
highest for the year 2014 (23.26) Further consecutive 
increase and decrease in the CFR in the upcoming years 
indicate clear fluctuations. However, CFR was found to be the 
lowest in the year 2019. 


STATE WISE DISTRIBUTION OF TOTAL CFR 
OF HIN] - (2010 - 2019) 





A thematic map of India showing average CFR of Influenza A 
(H1N1) 


The above map shows that Meghalaya, Dadra & Nagar Haveli, 
Chhattisgarh, Manipur, Daman & Diu, Arunachal Pradesh, 
Punjab, Madhya Pradesh, Himachal Pradesh and Odisha have 
the highest distribution of total CFR, followed by Andaman & 
Nicobar, Maharashtra, Uttarakhand, Jharkhand, Chhattisgarh 
having moderate distribution. Tamil Nadu, Bihar, Delhi, 
Lakshadweep have lowest total CFR distribution among 
theother. 


The case fatality rates calculated for 10 years are used for 
model development and to evaluate forecast performance. A 
graph is plotted to check the stationarity. The data is found to 
be no stationary, shown in Figure IV 


@IJTSRD | Unique Paper ID - IJTSRD39862 _ | 


Volume-5|Issue-3 | 


Figure IV: A sequence plot showing the trend line for the 
case fatality rate of the Influenza A (H1N1) from 
2019Therefore, the first order differencing of the series was 
carried out. The obtained differenced data was examined for 
stationarity in mean by computing the autocorrelation and 
partial autocorrelation coefficients (ACF and PACF). After 
examining Table IJ, Figure V and VI, it was concluded that it 
was Stationary in mean. 


Table II: ACF and PACF of CFR 
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ACF and PACF of differenced CFR 

Various orders of ‘p’ and ‘q’ were tired with the difference of 
one (d=1) and set 9 models with structure close to the 
tentative candidate model to allow the possible identification 
errors (Table III). This illustrates the importance of having 
many identification procedures so that the proper model is 
not missed just because one of the identification methods 
does not explicitly portray the best model tofit. 


Table III: Suggested Models 


sri MAPE NBIC 


$32 4.34 
* LET 4.774 


| 0.408 7.002 46.81 | 4.381 


stationary R-square 


40.425 ) 4.735 
49,44 3.162 





























(2,13) | |. 0.500 9,737 O17 = | 40.665 | 4.017 


Parameter estimation: It is observed that the model structure 
of (2, 1,3) have minimum value of normalized BIC, MAPE and 
good R-Square. Therefore we have selected ARIMA (2, 1, 3) 
model as a candidate model for the inflow series and 
parameters of selected model are given in Table IV. 


Table IV: Model statistics of ARIMA 
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Table V: Predicted values. 
Year Actual values Predicted values LCL UCL 


2010; 855 | 
2020; | 843 | 19.61 | 36.5 | 
2021; | 7.038 | 20.45 | 34.5 | 
2022; | 6.08 | 24.84 37 | 
2023; | 582-2588 | 37.5 | 
2024; | 556 | -25,98 | 37.1] 
2025; | S| -26,29 | 36.3 
2026| | 4.86 | 26,92 | 35.6 | 
2027, | 8.82 2.7.63 | 35.3 
2028] | 8.85 | 28,06 | 34.8 | 
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Showing residuals of ACF and PACF 

The mean CFR forecasted values for next 10 years have been 
calculated and the results are presented in Table 7, along 
with the observed records. From the figure 15, the projection 
shows a decreasing trend of case fatality rates of InfluenzaA 
(H1N1) after 2019 till 2029, and it is observed that the 
forecasted values are closer to that of the actual values and 
fall within the confidential limits (95%). 
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Forecast model of Case Fatality Rate 


IV. Discussion 
The respiratory infection popularly known as swine flu is 
caused by an influenza virus. An H1N1 subtype pandemic 


occurred in 1918, causing millions of deaths. H1N1 
reappeared in humans in 1976, and more recently in 2009.It 
was first recognized in spring 2009, near the end of the usual 
Northern Hemisphere flu season. The new virus, 2009 H1N1, 
spreads quickly and easily. A few months after the first cases 
were reported, rates of confirmed H1N1-related illness were 
increasing in almost all parts of the world. WHO declares on 
June 11, 2009, that H1N1 (Swine-influenza A) is pandemic. 
There have been nearly 30,000 confirmed H1N1 cases across 
74 countries [11]. It has been noted that around 2% of 
infected individuals in some countries have developed 
severe disease, and that this has occurred most commonly in 
those aged 30-50 years. In March 2009, novel 2009 
influenza A (H1N1) was first reported in the south-western 
United States and Mexico. The population and health care 
system in Mexico City experienced the first and greatest 
early burden of critical illness. Critical illness occurred in 58 
of 899 patients (6.5%) admitted to the hospital with 
confirmed, probable, or suspected 2009 influenza (A) H1N1. 
Critical illness from 2009 influenza A (H1N1) in Mexico 
occurred in young individuals, was associated with severe 
acute respiratory distress syndrome and shock, and had a 
high case-fatality rate. Around 12, 3397 people have been 
tested in India as on February 1, 2010. In India, 23.3% of 
people who have tested for swine flu are found suffering 
from swine flu. Also around 4% of people who have tested 
positive for swine flu have died and could not be saved in 
India. Maharashtra with 317 deaths has seen maximum 
deaths from swine flu in India. Gujarat also has 242 deaths 
from swine flu till February 1, 2010. Indian states suffering 
from are Rajasthan (176 deaths), Karnataka (141 deaths), 
and Delhi (93 deaths).Delhi has the highest number of 
patients who are tested positive for swine flu, with 9,652 
people tested positive. Maharashtra has 5,116 cases of swine 
flu affected patients and Rajasthan has 2,101 patients who 
were tested positive for swine flu till February 1, 2010. The 
states which are severely affected by swine flu have high 
exposure to foreigners!.The new influenza A (H1N1) virus is 
genetically related to swine flu viruses, and differs 
substantially from circulating human influenza viruses. It is 
able to spread from person to person. Because it is a 
completely new virus, there is probably little immunity in 
the population [12]. This pandemic of influenza A (H1N1) 
has shown mild to moderate severity. Overall case-fatality 
(CFR) estimates ranged from 0.054-0.086 per 1000 persons 
with nH1N1-associated ILI and were highest for persons=65 
years (0.094-0.147 per 1000) and lowest for those 0-17 
(0.008-0.012).Hygiene measures and administration of 
antiviral drugs to patients and their contacts may slow the 
spread [12]. 


Forecasting the dynamics of influenza outbreaks could be 
useful for decision-making regarding the allocation of public 
health resources. Reliable forecasts could also aid in the 
selection and implementation of interventions to reduce 
morbidity and mortality due to influenza illness [13]. For 
diseases that show recurrent seasonal patterns or occur in 
cyclic patterns, time series models are the most widely used 
Statistical models by health researchers for forecasting [14]. 
Time series forecasting is commonly performed using 
autoregressive integrated moving average (ARIMA) models 
M4that can accommodate both trend and seasonal variations. 
ARIMA models are typically selected by maximizing some 
measure of predictive accuracy. A study was conducted to 
identify circulating influenza virus strains and vulnerable 
population groups and investigate the distribution and 
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seasonality of influenza viruses in Ningbo, China. Then, an 
autoregressive integrated moving average (ARIMA) model 
for prediction was established. Influenza surveillance data 
for 2006-2014 were obtained for cases of influenza-like 
illness (ILI) (n= 129,528) from the municipal Centres for 
Disease Control and virus surveillance systems of Ningbo, 
China. The ARIMA model was proposed to predict the 
expected morbidity cases from January 2015 to December 
2015. Of the 13,294 specimens, influenza virus was detected 
in 1148 (8.64%) samples, including 951 (82.84%) influenza 
type A and 197 (17.16%) influenza type B viruses. ARIMA (1, 
1, 1) (1, 1, 0)12 model could be used to predict the ILI 
incidence in Ningbo. The seasonal pattern of influenza 
activity in Ningbo tended to peak during the rainy season 
and winter. Given those results, the model we established 
could effectively predict the trend of influenza-related 
morbidity, providing a methodological basis for future 
influenza monitoring and control strategies in the study area 
[15]. 


4.1. Limitations 

Missing observations are as frequent as variable time 
intervals between measurements. While regular time 
intervals can be accomplished by adaption of methods, 
missing values have to be bridged through interpolation or 
by the application of a Kalmanfilter. A limited number of data 
points can mean a Serious restriction to the applicability of 
time series analysis. The number of available measurements 
can be compromised both by the critical state of the 
respective patient and the thus limited time for 
measurement, and by methodological and organizational 
problems (e.g. laboratory). From a conservative point of 
view, more than 50 observations are needed for a reliable 
estimation of an ARIMA model. But more recent simulation 
experiments show that a number of 20 data points can be 
sufficient. 


V. Conclusion 

This study was able to analyse the variations comprising the 
case fatality rates of the Influenza A (H1N1) The 
formulation, estimation and forecasting of time series 
ARIMA model was built from a pragmatically point of view 
based on the available Influenza A (H1N1)data. The results 
indicate that the proposed ARIMA (2, 1, 3) model can 
forecast with a satisfactory goodness of fitand performance 
with the precision. The model was selected for its minimum 
value of normalized BIC, MAPE and good R-Square among all 
other models. Application of this model, could successfully 
forecast the decrease in the case fatality rate of Influenza A 
H1N1 for next 10 years. This model is recommended to the 
metropolitan health directorate and re-searchers who would 
want to monitor the incidence of Influenza A (H1N1) 
outbreaks in the metropolis and other parts of the world. 
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